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SUMMARY 


A  two-process  model  o£  pattern  discrimination  was 
developed  to  describe  how  tonal  sequences  are  processed,  stored, 
and  discriminated  by  the  human  auditory  system.  In  this  model, 
the  comparison  of  auditory  stimuli  involves  processing  in  two 
separate  concurrent  modes,  with  the  inode  contributing  the  least 
variance  having  the  dominant  effect  on  performance.  In  the 
'sensory  trace'  mode,  the  subject  compares  a  trace  of  the 
auditory  sensation  to  the  corresponding  trace  of  a  subsequent 
stimulus.  The  trace  is  assumed  to  decay  with  time,  but  be 
independent  of  the  number  of  possible  stimuli.  In  the  'context 
coding'  mode,  stimuli  are  d iscr imi nated  by  comparing  encoded 
representations,  each  representation  consisting  of  a  comparison 
between  the  given  stimulus  and  the  overall  context  of  other 
stimuli  heard.  The  quality  of  the  encoded  representation  is 
assumed  to  depend  on  the  size  of  the  auditory  context. 

Thl3  model  was  applied  to  tasks  in  which  a  subject  compared 
two  sequences  of  tones  and  judged  whether  the  frequency  patterns 
were  the  same  or  different.  Three  aspects  of  the  sequences  were 
manipulated:  (a)  the  temporal  variability  of  the  sequences,  (b) 
the  correlation  between  the  temporal  envelopes  of  each  pair  of 
sequences  presented  on  a  trial,  and  (3)  the  time  interval  between 
the  two  sequences.  In  the  'correlated'  condition,  the  two 
sequences  on  a  trial  had  identical  temporal  patterns,  while  in 
the  'uncorrelated'  condition,  all  sequence  temporal  patterns  were 
generated  independently.  In  order  to  study  the  properties  of  the 
trace  mechanism,  various  temporal  and  spectral  transformations 
were  made  to  the  second  sequence  of  each  pair. 


Performance  in  the  correlated  condition  was  Independent  of 
temporal  variability,  but  dropped  with  increases  in  the  time 
interval  between  sequences.  Performance  in  the  uncorrelated 
condition  decreased  with  increased  temporal  variability,  and  was 
independent  of  the  length  of  the  i riter -sequence- i nterval .  These 
results  support  the  assumptions  of  the  trace/context  theory;  good 
fits  were  obtained  between  data  from  human  observers  and  the 
predictions  of  the  two-process  model.  Further,  performance  in  the 
correlated  condition  was  remarkably  insensitive  to  certain  types 
of  transformations  made  to  the  second  sequence,  suggesting  that 
(a)  sensory  traces  ran  be  generated  for  separate  aspects  of  a 
sensory  stimulus,  or  (b)  traces  can  be  temporally  transformed  or 
"scaled"  for  comparison  with  other  stimuli. 

An  analysis  of  the  underlying  pattern  discrimination 
mechanism  was  done  by  evaluating  two  computational  models  of 
sequence  comparison:  a  weighted  average  algorithm  and  an  optimal 
sequence  matching  algorithm.  The  overall  pattern  of  performance 
for  both  models  was  similar  to  that  of  the  human  subjects,  but 
tr ial -by-tr lal  correlations  between  the  human  and  algorithm 
responses  was  low,  suggesting  that  neither  algorithm  provided  an 
accurate  description  of  the  human's  discrimination  mechanism. 

Some  general  assumptions  of  the  signal  detection  model  were 


evaluated  in  experiments  with  auditory  tonal  sequences  and  visual 
displays  generated  by  similar  statistical  rules.  These 
assumptions  concern  the  manner  in  which  the  observer  processes 
information  from  individual  elements  of  the  auditory  sequence  or 
visual  array.  The  experimental  results  were  consistent  with 
expectations:  (1)  In  sequential  auditory  displays,  the  first 
arriving  (primacy)  and  last  arriving  (recency)  elements  are 
emphasized;  (2)  with  visual  displays  the  fixated  region  is 
emphasized,  the  effective  size  of  the  region  depending  on  the 
element  coding  and  format. 
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1.  INTRODUCTION  AND  PURPOSE 


A  "simple"  sound,  such  as  o.  br let  pure  tone,  can  be  made 
into  a  "complex"  sound  by  concatenating  the  "simple"  sounds  into 
sequences.  Our  goal  is  to  understand  how  such  tonal  patterns 
are  processed  by  the  human  auditory  system.  First,  we  extend  an 
existing  model  of  auditory  signal  discrimination  to  the  complex 
sequence  discrimination  task.  Second,  we  test  some  specific 
comparison  algorithms  which  may  form  the  basis  Lor  the  sequence 
pattern  comparison  operation.  Third,  we  extend  the  general 
discrimination  model  to  the  case  where  linear  transformations  in 
duration  have  been  applied  to  the  tonal  patterns,  and  we  clarify 
some  special  properties  of  auditory  trace  memory.  Finally,  we 
summarize  an  applications  of  the  detection  theory  model  to  the 
analysis  of  multiple  element  auditory  and  visual  displays. 

1.1  The  Basic  Experimental  Paradigm 

Many  Investigators  have  tided  that  varying  the  parameters  of 
a  tonal  sequence  can  have  a  large  effect  on  performance  in  a 
discrimination  or  detection  task.  For  example,  Watson  and  his 
colleagues  have  reported  a  number  of  studies  in  which  a  subject 
had  to  detect  small  differences  between  tonal  sequences  whose 
parameters  varied  across  tire  experimental  trials.  This  type  of 
trial-by-trial  variation  produced  large  changes  in  the  subject's 
ability  to  detect  small  changes  in  the  frequency,  duration,  or 
intensity  of  one  component  of  the  tonal  sequence  (Watson,  Kelly 
and  Wroton,  1976;  Spiegel,  and  Watson,  1981;  and  Leek  and  Watson, 
1984).  Watson  (1987)  pointed  cut  that  these  effects  can  be  much 
larger  than  those  obtained  from  single  tone  ( 2 1 F C )  discrimination 
tasks  in  which  the  tone  parameters  vary  over  trials. 

We  reported  similar  effects  in  an  experiment  which  tested 
the  discrimination  of  temporal  jitter  in  tonal  sequences  (Sorkiri, 
Boggs  and  Brady,  1982).  Subjects  in  that  experiment  were 
presented  with  a  pair  of  tone  sequences  composed  of.  high  and  low 
frequency  tone  bursts.  The  subject's  task  was  to  discriminate 
differences  in  the  temporal  envelopes  of  the  two  sequences  and 
Ignore  differences  in  the  frequency  pattern  of  the  sequences.  We 
were  Interested  in  how  temporal  discrimination  depended  on  the 
pattern  of  tone  frequencies.  When  the  frequency  patterns  were 
fixed  over  trials,  performance  was  quite  good,  and  was  described 
well  by  a  simple  model  of  temporal  discrimination.  When  the 
frequency  pattern  for  each  sequence  was  varied  randomly  over 
trials,  but  fixed  for  the  pair  of  sequencers  within  a  trial, 
performance  was  poor.  When  the  binary  pattern  was  random  both 
across  and  within  trials,  performance  was  even  worse. 

The  present  experiments  continue  our  examination  of  the 
effects  of  spectral  and  temporal  variation  on  sequence 
discrimination.  In  the  current  experiments,  the  subject  must 
discriminate  differences  between  the  frequency  patterns  of  two 
tonal  sequences  and  ignore  differences  In  the  temporal  envelopes 
of  the  sequences.  We  were  Interested  In  the  effect  of  variations 
In  the  temporal  envelope  of  the  sequences  on  the  subject's 


ability  to  discriminate  different  patterns  of  tone  frequency. 

The  subject's  ability  to  discriminate  between  the  sequence 
frequency  patterns  was  evaluated  with  a  same/different  task.  on 
each  trial,  subjects  were  presented  with  a  pair  of  sequences  arid 
reported  whether  the  frequency  pattern  In  the  two  sequences  was 
the  same  or  different.  On  approximate ly  half  of  the  trials,  the 
two  sequences  had  different  frequency  patterns;  on  approximately 
half  of  the  trials,  they  had  the  same  frequency  sequence. 

Subjects  were  instructed  to  ignore  any  variations  in  the  temporal 
envelope  of  the  sequences,  such  as  jitter  in  the  tone  onsets, 
durations,  or  inter -tone  gaps. 

An  Important  aspect  of  the  experimental  paradigm  was  the 
manner  in  which  the  temporal  envelopes  of  the.  sequences  were 
varied.  In  one  condition,  the  cot  related  condition,  the  temporal 
envelopes  were  varied  across  trials  but.  were  identical  within 
trials,  e.a.  the  pair  of  sequences  within  a  trial  had  identical 
temporal  envelopes.  In  a  second  condition,  the  uncorrelated 
condition,  the  temporal  envelopes  were  varied  both  across  and 
within  experimental  trials.  Tims,  in  th<-  uncor related  condition, 
the  pair  of  tone  sequences  within  a  trial  had  d  l f  f  ••rent  temporal 
patterns . 
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H(j  1.  Example*  of  the  pairs  o\  sequence**  present  on  trials i»fe.».f  eninenl  1 
1  he  first  pair  ol  sequences  has  diffeicnt  patterns  of  high-  and  Knx-frequcno 
tores  but  the  same  pattern  of  iniertone  gaps.  T  lie  set  mj  pair  of  sequences 
h.is  the  same  frequency  pattern  hut  different  temporal  patterns 


These  conditions  are  illustrated  in  Figure  1,  which  shows 
some  of  the  tonal  sequences  possible  on  trials  of  a  typical 
experiment.  These  tonal  sequence;;  were  composed  of  tones  of  one 
of  two  frequencies:  a  high  ( II )  frequency,  or  a  low  (L)  frequency. 
In  the  first  example  the  frequency  sequences  are  different, 
therefore  the  correct  response  is  "different".  This  example 
illustrates  the  correlated  condition,  in  which  the  two  sequences 
have  identical  temporal  envelopes.  The  second  example 
illustrates  a  pair  of  sequences  in  the  uncorrelated  condition. 

The  frequency  pattern  of  these  two  sequences  is  the  same,  but  the 
temporal  envelopes  are  different;  t.he  correct  response  is  now 
"same" . 


Suppose  that  t.he  observer  has  available  a  memory  trace  of 
each  Input  sequence,  and  that  this  trace  is  a  close 


2 


representation  of  the  acoustic  input.  In  the  correlated 
condition,  the  memory  trace  of  the  two  sequences  can  be  compared 
directly,  since  any  difference  between  the  traces  of  the  two 
sequences  will  be  due  to  differences  in  their  respective 
frequency  patterns.  This  assumes  that,  the  trace  will  not  be 
degraded  during  the  time  lnterv.il  it  must,  be  held  prior  to  the 
comparison  operation.  In  the  uncorrelated  condition  the  two 
input  sequences  do  not  have  the  same  temporal  envelope,  and  a 
direct  comparison  of  the  two  memory  traces  is  not  appropriate. 

In  the  uncorrelated  condition,  a  direct  comparison  of  the  traces 
of  the  two  inputs  may  confound  differences  in  the  frequency 
patterns  of  the  sequences  with  differences  in  their  temporal 
envelopes.  In  the  uncorrelated  condition,  it  will  be  necessary 
for  the  observer  to  first  transform  or  encode  the  input 
sequences,  in  order  to  remove  those  aspects  of  the  sequences  that 
are  due  to  differences  in  the  temporal  envelopes. 

2.  SEQUENCE  DISCRIMINATION  THEORY 

Tanner  (1961)  proposed  a  general  framework  for  categorizing 
the  memory  requirements  of  a  variety  of  single  and  two  -  inter  v.*  i 
psychophysical  tasks;  this  framework  has.  been  extended  and 
quantified  by  a  number  of  workers  including  Gorkin  (1962)  in  a 
study  of  the  same/different  task,  and  Macmillan,  Kaplan,  and 
Creelman  (1977)  in  a  study  of  categorical  perception.  Tanner's 
model  included  a  short-term  decaying  memory  for  t.he  acoustic 
input  to  tile  system  plus  various  interference  and  long-term 
memory  factors.  The  trace  decay  component  of  the  Tanner  model 
was  extended  by  Kinchla  and  Smyzer  (1967)  and  later  incorporated 
as  the  trace  component  of  t. he  trace -context  model  developed  by 
Durlach  and  his  colleagues  (e.g.  Durlach  and  Braida,  1969; 
Berliner  and  Durlach,  1973).  Tn  Gorkin  (1994),  we  proposed  that 
the  Durlach  and  Braida  dual  mode  model  could  be  extended  to  the 
sequence  discrimination  task. 

According  to  the  Dur lach-hr  » Ida  model,  ,t  subject  can  employ 
two  different  processing  modes  in  a  discrimination  task:  a  trace 
mode  and  a  context  mode.  In  the  trace  mode,  the  comparison 
operation  is  performed  on  internal  memory  traces  of  the  input 
signals.  The  trace  Is  a  direct  representation  of  the  acoustic- 
input,  e.g.  a  precategor ica 1  replica,  and  the  trace  decays  or 
accumulates  noise  over  time.  Thus,  performance  in  the  trace 
mode  will  deteriorate  as  the  time  interval  between  the  two  inputs 
to  be  discriminated  is  increased.  In  the  context  mode,  each 
input  is  first  categorized  into  one  of  a  defined  set  of  codes. 

The  comparison  operation  is  then  performed  on  these  encoded 
representations.  Once  encoded,  the  internal  data  are  not  subject 
to  degradation  over  time.  However,  there  is  a  context  noise 
present  which  is  a  function  of  the  difficulty  of  the  encoding 
process.  Task  variables  such  as  t.he  stimulus  range  are  assumed 
to  Increase  this  context  noise.  Performance  In  a  given  task 
will  usually  involve  trade  o(ts  in  the  internal  noise  associated 
with  each  mode  of  operation. 

These  considerations  are  incorporated  into  the  Durlach  and 
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Braida  model  In  the  following  manner: 
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sensory,  context,  and  trace  aspects  of  tin  took,  i  -espoot  i  vc  ly . 

The  sensory  variance  term  refers  to  the  noise  from  Internal 
sources  other  than  those  associated  with  <  -needing  and  storing  the 
sequence  information.  It  can  he  seen  from  an  examination  of 
equation  1  that  the  trace  and  context  variances  will  operate 
together  in  determining  which  process,  or  whether  either 
process,  will  have  an  effect  on  performance.  For  example,  if.  the 
variance  associated  with  one  process  is  much  smaller  than  the 
variance  associated  with  the  other  process,  the  process  with  the 
smaller  variance  will  have  the  dominant  effect  on  performance. 

In  the  sequence  discrimination  tasks,  of  Ui<*  present  study, 
we  assume  that  flit-  sensory  variance  Is  very  small  relative  to  the 
context  and  trace  variances;  therefore  we  shall  ignore  this  term 
in  the  remainder  of  our  analysis.  The  relative  size  of  the  trace 
and  context  variance  components  will  depend  on  the  magnitude  and 
type  of  variability  present  in  the  sequence  temporal  envelopes. 

In  general,  we  would  expect  that  the  magnitude  of  the  trace 
variance  would  be  high  in  the  uncorrelated  condition  relative  to 
the  correlated  condition,  and  therefore  context  mode  processing 
would  be  dominant  in  the  uncorrelated  condition. 

We  assume  that  the  variance  of  the  trace  component  will 
Increase  with  the  presence  of  any  differences  between  the  two 
traces  that  are  Irrelevant  to  the  frequency  discrimination  task. 
These  differences  may  be  produced  by  two  main  factors:  (a)  the 
time  period  during  which  the  trace  of  the  first  sequence  must  be 
held  in  order  to  make  a  comparison  with  the  trace  of  the  second 
sequence,  and  (b)  differences  between  the  sequence  trace:;,  such 
as  caused  by  temporal  variability,  that  are  irrelevant  to  the 
discrimination  task.  These  variables  are  summarized  in  the 
following  equation: 
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t 

l  ri  the  equation,  c'2  and  c  3  are  constants  which  apply  to  the 
storage  time  and  temporal  variability  components,  respectively. 
The  first  term  specifies  the  mean  duration  of  the  interval  from 
the  beginning  of  one  stimulus  sequence  to  the  beginning  of  the 
second  sequence;  ISI  is  the  i nte r -sequence - inter va 1 ,  n  is  the 


number  of  tones  in  each  sequence,  d  is  the  average  duration  of 
each  tone,  and  g  Is  the  average  duration  of  the  gaps  between  the 
tones.  The  magnitude  of  the  second  term  depends  on  some  measure 
of  the  correlation,  r,  between  the  temporal  envelopes  of  the  two 

sequences . 

In  the  correlated  conditions,  r  is  approximately  unity;  in 
the  uncorrelated  conditions,  r  i3  a  function  of  the  variability 
of  the  temporal  gap  between  tones,  'y ,  I  lie  variability  of  the 
tone  duration,  and  the  task  parameters  n,  d,  and  g.  Because 

we  do  not  have  a  closed  form  expression  for  the  correlation  of 
the  sequence  temporal  envelopes  as  a  function  of  C  y,  c ,  g,  n, 
and  d,  we  use  the  expression  <-t(  Cq/ g)2  •  an  approximation  to 
the  righthand  term  of  equation  2,  in  the  uncorrelated  condition. 
This  assumption  yields  the  following  expressions  f or  performance 
in  the  sequence  tasks: 

Performance  in  the  uncorrelated  condition, 
d • ( unc  )  = 

-2  2-11/2 
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and  performance  in  the  correlaced  condition, 
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The  context  variance  is  assumed  to  depend  on  the  complexity 
of  the  encoding  operation  required.  Thus,  the  context  variance 
is  assumed  to  increase  with  the  number  of  different  frequency 
patterns  that  are  possible  during  a  Mock  of  trials  of  the 
sequence  experiment.  This  is  a  slight  departure  from  the 
approach  taken  by  Durlach  and  Braida  (1969)  in  which  they  assumed 
that  the  context  variance  is  a  function  of  the  total  range  of  the 
stimulus  set.  Task  factors  which  do  not  increase  the  encoding 
requirement  should  not  affect  t  he  context  variance.  Thus,  we 
assume  that  the  context  variance  will  not  be  a  function  of 
variability  in  parameters  related  to  the  sequence  temporal 
envelope,  such  as  C  q  and  /q .  Of  course  one  could  argue  that 
increasing  the  number  of  possible  inputs,  such  as  by  adding 
temporal  variability,  will  Increase,  the  difficulty  of  the 
encoding  operation,  and  hence  will  increase  context  variance. 

Our  preliminary  assumption  13  that  these  factors  will  have  a  much 
greater  influence  on  the  trace  comparison  process  than  on  the 
context  encoding  process. 


3.  SEQUENCE  DISCRIMINATION  EXPERIMENTS 


3.1  General  Method 


we  conducted  experimental  tests  or  several  aspects  of 
sequence  discrimination  performance  described  by  the  model. 

Table  I  summarizes  the  conditions  and  variables  in  our 
experiments.  The  major  focus  in  these  experiments  was  on  the 
different  pattern  of  performance  in  the  correlated  and 
uncorrelated  conditions.  In  experiments  1,  2a,  and  3a,  the 
independent  variable  was  the  magnitude  ol  the  variability  of  the 
lnter-tone  gaps.  These  three  experiments  provide  a  picture  of 
how  performance  depends  on  gap  variability  in  the  correlated  and 
uncorrelated  conditions.  In  experiment  2b,  the  independent 
variable  was  the  mean  gap  between  tones.  This  experiment  allows 
performance  in  the  correlated  and  uncorrelated  conditions  to  be 
compared  at  different  gap  Intervals  and  a  constant,  gap 
variability.  In  experiments  2c  and  3b,  the  independent  variables 
were  tne  number  of  tones  in  each  sequence  and  the  length  of  the 
inter -sequence  interval.  These  experiments  evaluate  how 
performance  in  the  correlated  and  uncorrelated  conditions  changes 
as  the  number  ot  tones  and  inter -sequence -- i  liter  va  1  are  increased. 

These  experiments  employed  different  procedures  for 
generating  the  set  of  frequency  patterns  and  tor  generating  the 
frequency  patterns  on  "different"  trials.  The  procedures  were 
developed  over  a  period  of  time  using  different,  groups  of 
observers.  The  major  procedural  differences,  between  experiments 
1  and  2  and  experiment  3  wer*'  (a)  an  increase  in  the  complexity 
and  number  of  possible  frequency  patterns  and  (b)  the  addition  ot 
variability  to  the  tone  durations.  The  motivation  for  increasing 
the  complexity  ..f  the  sequence  patterns  in  experiment  3  was  to 
make  It  more  difficult  for  observers  to  learn  the  set  ot  patterns 
used.  In  our  early  experiments  we  had  observed  that  the 
dependence  of  performance  on  I  id  I  (in  the  correlated  condition) 
tended  to  be  attenuated  with  extensive  practice  on  the  task.  We 
believed  that  extensive  practice  with  a  small  set  ot  patterns 
could  result  in  t  tie  cont  ext  coding  process  becoming  dominant  i  n 
ttiat  condition.  Var  1  ab  i  1  i  t  y  to  the  tone  durations  was  added  in 
order  to  test  the  effect  ot  synchronizing  the  tone  onsets  on  the 
variability  man i pu lat ion .  These  changes  resulted  in  a  large 
increase  in  the  difficulty  oi  t.he  discrimination  task.  In  order 
to  bring  observer  performance  into  the  same  range  us  the  other 
experiments,  it  was  necessary  to  change  t.he  rule  for  generating 
patterns  on  "different"  trials.  The  specific  procedures  relating 
to  each  experiment  are  described  further  below. 

Three  groups  of  subjects  consisting  of  three  subjects  per 
group  participated  in  the  experiments.  Thus,  one  group  of 
subjects  served  in  experiment  1;  a  second  group  of  subjects 
served  in  experiments  2a,  b,  and  c;  and  a'  third  group  served  in 
experiments  3a  and  b.  The  subjects  were  undergraduate  students 
paid  an  hourly  rate  plus  a  fractional  incentive  bonus  tor  correct 
responses.  Ail  had  normal  hearing  and  extensive  experience  with 
psychoacoustl c  tasks.  They  performed  the  task  for  approximately 
two  hours  per  day,  three  days  per  week.  Subjects  were  seated  lu 
a  double-walled  Industrial  Acoustics  chamber.  Stimuli  were 
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presented  to  the  subject's  right  ear  over  TDH-39  headphones.  All 
signals  were  presented  at  71  dB  SPL . 


TABLE  I.  Summary  of  variables  and  conditions  for  the  sequent  (‘discrimination  experiments  Ejcli  experiment  (1.2.3)  employed  a  different  jjroupof  three 
subjects.  ( Frequencies  are  in  Hertz,  durations  in  milliseconds. ) 
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The  experimental  task  was  a  same/d i f Cerent  task:  subjects 
were  presented  with  a  pair  of  tonal  sequences  and  had  to 
determine  if  the  frequency  pattern  in  the  two  sequences  was  the 
same  or  different.  Subjects  answered  via  a  set  of  two 
(same/different)  pushbuttons.  Subjects  were  instructed  to  ignore 
variations  In  the  temporal  structure  of  the  sequences,  such  as 
jitter  in  the  tone  onsets,  durations,  or  inter-tone  gaps. 

Feedback  about  the  correct  response  on  each  trial  was  provided  to 
the  subjects.  All  tone  bursts  were  sinusoidal  segments.  Tones 
were  synthesized  via  12  bit  (experiment  3)  or  13  bit  (experiment 
1  and  2)  digital  to  analog  converters  at  a  sampling  rate  of  20 
KHz  and  low-pass  filtered  at  SKH7. .  A  symmetric  2  ms  linear 
envelope  was  used  as  a  gating  function.  Temporal  variability  was 
implemented  by  a  subroutine  that  added  positive  or  negative  time 
increments  to  the  inter-tone  gaps  according  to  a  Gaussian 
approximation.  The  maximum  3ize  of  an  increment  or  decrement  was 
2.5  times  the  gap  standard  deviation;  negative  gaps  were  not 
allowed . 

Subjects  were  trained  on  each  of  the  types  of  temporal 
matching  conditions  until  consistent  performance  was  obtained. 
Daily  testing  sessions  consisted  of  from  5  to  7  blocks  of  100 
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trials.  The  experimental  conditions  were  fixed  within  each 
block;  only  one  temporal  matching  condition  or  mean  gap  duration 
was  run  in  a  given  session.  The  yap  standard  deviation  was 
changed  randomly  from  block  to  block  within  each  session. 

3.2  Experiment  1 

3.2.1  Procedure 


The  eight  tone  binary  sequences  were  constructed  by  sampling 
from  tone  frequencies  of  669  and  1621  Hz.  The  first  sequence  on 
a  trial  was  created  by  sampling  from  a  set  of  54  different  binary 
patterns.  These  54  sequences  were  formed  by  starting  with  the  set 
of  all  sequences  of  8  tones  which  contain  equal  numbers  of  high 
and  low  tones,  and  then  arbitrarily  deleting  all  sequences  which 
had  regular  appearing  patterns,  such  as  HLHLHLHL .  On  same 
trials,  the  second  sequence  generated  was  the  same  pattern  as  the 
first.  On  different  trials,  two  adjacent  (different  frequency) 
tones  In  the  sequence  were  transposed  in  position.  Neither  the 
first  pair  or  the  last  pair  of  tones  in  the  sequence  were  subject 
to  this  manipulation .  Figure  1  illustrates  some  sample 
sequence  pairs  on  typical  trials  of  experiment  1.  Subjects  were 
run  on  at  least  300  trials  per  condition. 

3.2.2  Results 


The  average  performance  of  the  three  subjects  as  a  function 
of  the  gap  standard  deviation  is  shown  in  figure  2.  The  data 
from  individual  subjects  had  essentially  the  same  form  as  that 
shown  in  the  figure.  The  standard  error  of  data  points  from 
individual  subjects  was  typically  0.15  or  less.  Performance  in 
the  correlated  condition  was  independent  of  the  jitter  magnitude. 
Performance  in  the  uncorrelated  condition,  however,  dropped  well 
over  one  d'  unit  as  the  gap  standard  deviation  increased. 


o 


ft 


.) 


l*l< »  2  Average  performance  (plotted  points;  is  a  function  of  t)u  gapsf.m 
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100  tils)  Solid  lines  ate  theon  >u.d  fum  tarns  (see  text ) 
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3.3  Experiment  2 

3.3.1  Procedure 

Experiment  2  was  similar  to  experiment  1,  except:  in  the 
manner  of  choosing  the  frequency  patterns  and  in  the  employment 
of  a  new  group  of  three  subjects.  The  sequences  were  again 
binary  tonal  sequences  of  frequencies  669  and  1621  Hz,  with  an 
equal  number  of  high  (H)  and  low  (L)  tone:;.  On  each  trial  the 
first  tonal  sequence  was  selected  randomly  from  a  previously 
defined  set  of  16  different  sequence  patterns  composed  of  8  tones 
each.  On  same  trials,  the  first  sequence  was  presented  twice 
during  the  trial.  On  different  trials,  a  second  sequence  was 
sampled  from  the  set.  Tonal  sequences  composed  of  12  or  16  tones 
were  created  by  adding  a  four  tone  prefix  or  a  four  tone  prefix 
plus  a  four  tone  suffix  to  the  basic  8  tone  sot.  These  four  tone 
additions  were  random  sequences  subject  to  the  equal  H  and  L 
constraint . 

In  experiment  2a,  performance  in  the  uncorrelated  condition 
wa3  evaluated  at  a  mean  gap  duration  of  SO  ms  and  gap  standard 
deviations  of  10,  20,  and  40  ins;  performance  in  the  correlated 
condition  was  assessed  only  at  a  gap  standard  deviation  of  40  ms. 
In  experiment  2b,  performance  was  evaluated  in  the  correlated  and 
uncorrelated  conditions  at  a  fixed  gap  standard  deviation  of  40 
ms,  and  at  mean  gap  durations  of  SO,  100,  ISO,  and  200  ms.  In 
experiment  2c,  performance  was  evaluated  in  the  correlated  and 
uncorrelated  conditions  at  a  mean  gap  duration  of  SO  ms  and  a  gap 
standard  deviation  of  40  ms,  for  sequences  composed  of  8,  12,  and 
16  tones.  Subjects  ran  300  tiials  per  condition  of  experiment 
2a,  and  200  trials  in  each  condition  of  experiments  2b  and  2c. 

3.3.2  Results 

Figure  3  illustrates  the  averaged  results  of  experiment  2a. 
The  pattern  of  results  was  consistent  with  that  of  experiment  1 
shown  in  figure  2;  the  standard  error  of  individual  subject  data 
points  was  less  than  0.2.  Figure  4  is  a  plot  of  the  average 
performance  of  the  three  subjects  in  experiment  2b.  Performance 
is  plotted  as  a  function  of  the  mean  gap  duration,  g,  at  a  gap 
standard  deviation  of  40  ms.  Performance  of  the  individual 
subjects  was  similar  to  the  aveiaged  data  shown.  Performance  In 
the  correlated  condition  decreased  with  increases  in  mean  gap 
duration.  Performance  in  the  uncorrelated  condition  was  quite 
poor  at  short  gap  durations.  As  the  mean  gap  duration  increased, 
performance  In  the  correlated  and  uncorreiated  conditions 
converged.  Figure  5  illustrates  the  results  of  experiment  2c; 
this  shows  the  effect  of  the  number  of  tones  in  each  sequence  on 
performance  in  the  two  temporal  correlation  conditions. 
Performance  in  both  conditions  dropped  as  n  Increased  from  8  to 
16;  the  largest  drop  occurred  in  the  correlated  condition. 
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3.4  Experiment  3 
3.4.1  Procedure 


In  experiment  3a  an  additional  temporal  condition,  the 
synchronized  condition,  was  added  to  the  correlated  and 
uncorrelated  conditions  of  experiment  1  and  2.  In  order  to  make 
the  context  coding  component  of  the  task  more  demanding,  the 
number  of  possible  frequencies  was  increased  from  2  to  4,  and  the 
number  of  different  patterns  (on  same  trials)  from  16  to  2520. 
Figure  6  illustrates  some  tonal  sequences  possible  on 
experimental  trials  from  the  three  types  of  temporal  variability 
conditions . 
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In  the  correlated  condition  (c)  shown  In  figure  6,  tin- 
pattern  of  tone  onsets,  durations,  and  inter  tone  gaps  is 
Identical  in  the  two  sequences.  Note  that  for  this  particular 
pair  of  sequences  the  frequency  pattern  is  different  In  the  two 
sequences,  so  that  the  correct  response  is  "different".  The 
second  pair  of  sequences  illustrates  the  uncorrelated  condition 
(U).  Here  the  tone  onsets,  durations,  and  gaps  are  uncorrelated 
between  the  two  sequences.  The  frequency  pattern  is  identical  on 
the  trial  shown,  hence  the  correct  response  is  "same".  The  last 
pair  of  sequences  illustrates  the  synchronized  condition  (S).  In 
this  condition  the  tone  onsets  are  Identical  in  the  two 
sequences,  but  the  durations  and  gaps  are  varied.  In  this 
particular  pair  of  sequences  the  frequency  pattern  is  identical, 
so  the  correct  response  is  "same". 

The  tonal  sequences  were  composed  of  eight  tones;  each  tone 
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waa  selected  from  a  set  of  tout  frequencies:  St>0,  yuy,  1667,  and 
2857  Hz.  The  frequency  pattern  of  the  first  sequence  of  each 
trial  was  randomly  determined  but  subject  to  the  constraint  that 
each  frequency  appeared  twice,  on  different  trials,  three  of  the 
tone  positions  of  the  sequence  were  randomly  selected  to  be 
changed  from  their  original  frequency  to  one  of  the  three 
remaining  frequencies.  The  first  and  last  tone  position  were 
excluded  from  this  selection. 

Manipulation  of  the  temporal  character  ist lcs  of  the  two 
sequences  was  accomplished  In  a  manner  different  from  the 
previous  experiments.  The  tone  durations  were  randomly  selected 
from  a  set  of  five  durations  (20,  30,  40,  50,  and  60  ms)  having  a 
mean  duration  of  40  ms  and  a  standard  deviation  of  12.25  ms.  In 
Experiment  3a.,  same/dlf fererit  performance  was  evaluated  In  the 
same  manner  as  in  the  previous  experiments  at  a  mean  gap  duration 
of  50  ms  and  at  gap  standard  deviations  of  5,  10,  20,  30,  or  40 
ms  (omitting  the  5  ms  correlated  condition).  In  experiment  3b., 
performance  was  assessed  at  a  mean  gap  dur.it  Ion  of  50  ms,  at  a 
gap  standard  deviation  of  30  ms,  and  at  inter -sequence-intervals 
(ISIS)  of  500,  1500,  and  3000  ms,  U3ing  a  rating  response  instead 
of  the  two-response  same/different  task. 
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3.4.2  Results 


The  average  performance  of  three  subjects  In  experiment  3a 
is  Illustrated  in  Figure  7,  which  shows  the  effect  of  gap 
variability  on  the  discr iminabi 1 i ty  of  the  stimulus  sequences  in 
the  three  conditions  run,  at  a  mean  gap  of  50  ms.  Individual 
subject  data  was  based  on  at  least  200  trials  per  point.  The 
standard  error  of  Individual  data  points  was  less  than  0.2. 
Performance  in  the  correlated  condition  was  essentially 
independent  of  gap  variability.  Performance  in  the  synchronized 
condition  Improved  slightly  at  high  jap  variabilities. 

Performance  In  the  uncorrelated  condition  dropped  approximately  1 
d'  unit  and  then  leveled  off. 

The  results  of  Experiment  3b  are  shown  In  Figure  8,  which  is 
a  plot  of  performance  at  a  gap  standard  deviation  of  30  ms,  as  a 
function  of  the  inter-sequence- interval .  Performance  in  the 
correlated  and  synchronized  conditions  decreased  with  increased 
ISI.  Performance  in  the  uncorrelated  condition  was  independent 
of  ISI. 

3.5  Estimation  of  Model  Parameters 

In  order  to  evaluate  the  relationship  between  the  model 
predictions  and  the  data,  equations  3a  and  3b  were  simplified  as 
follows : 

d ' ( unc )  = 

{  A  +  I  B  ( I S I  4  nd  f  (n-l)g)  4  C(  ^q/q)2  1  1  )1/2  (4a) 

and 

d'(cor)  =  {  A  4  (  B ( I S I  +  nd  4  (n-l)g))"1  }2/2  (4b) 

where  A,  B,  and  C  are  constant?  which  Incorporate  the  model 
parameters : 


2  2  2  2 
A  =  c  /  ['  ,  B  -  c  /  c  ,  and  C  =  c  /  c 

1  c  2  1  3  1 


Equation  4a  was  fit  separately  to  the  data  of  experiments  1,  2a, 
and  3a,  U3lng  the  Gauss-Newton  method.  We  assumed  that  at  a  gap 
standard  deviation  of  zero,  performance  in  the  uncorrelated 
condition  wa3  equal  to  average  performance  in  the  correlated 
condition.  Thus,  the  three  parameters  were  fit  to  5,  4,  and  6 
data  points  in  experiments  l,  2a  and  3a,  respectively.  Table  II 
summarizes  the  parameter  values  computed  by  this  procedure. 

The  theoretical  curves  generated  with  these  parameters  are 
shown  in  figures  2,  3,  and  7.  The  parameter  values  derived  from 
the  data  of  experiment  2a  were  then  used  to  generate  the 
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theoretical  curves  shown  In  figures  4  and  5,  and  In  similar 
fashion  the  parameter  values  derived  from  experiment  3a  were  used 
to  generate  the  theoretical  curves  shown  In  figure  8.  The 
theoretical  curve  for  the  uncorrelated  condition  In  figure  5 
actually  decreases  slightly  as  a  function  of  n,  as  does  the 
theoretical  curve  for  the  uncorrelated  condition  shown  in  figure 
8. 


Table  II.  Parameter  values  for  Simplified  Model  (Equations  4a, b) 


Exper iment 

Parameter 

A 

B 

C 

1 

1. 

155 

0 . 000071 

0.618 

2a 

1. 

069 

0 . 0001  31 

5.930 

3a 

0. 

724 

0.000155 

34.05 

Table  III. 

Model  Parameter  Values  (assume  C2  =  0.001) 

Experiment 

Parameter 

2 

r 

c 

c 

1 

c 

3 

1 

12 

.  19 

3 . 75 

8.70 

2a 

7 

.  1  4 

2.76 

45.27 

3a 

0 

.91 

2 . 54 

219.69 

Because  the  parameter  C2  describes  a  general  property  of  the 
trace  decay  process,  we  expected  it  to  be  constant  over  the 
different  experiments.  Assuming  a  particular  value  for  C2 
enables  calculation  of  the  remaining  parameter  values  for 
purposes  of  comparison.  Table  III  summarizes  the  model 
parameters  which  result  from  the  assumption  that  C2  is  equal  to 
0.001  in  experiments  1,  2a,  and  3a.  It  is  apparent  that  the 
computed  value  of  the  context  coding  variance,  was  highest 

in  experiment  1,  although  the  values  do  not  differ  greatly  over 
the  three  experiments.  Because  the  size  and  nature  of  the 
pattern  set  varied  across  the  three  experiments  (see  Table  I), 

Cq  should  not  have  been  constant  across  experiments  1,  2a,  and  3a. 
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However ,  the  largest  value  of  t  c  was  obtained  in  experiment  1, 
contrary  to  our  expectation  that  the  size  of  the  pattern  set 
would  be  the  main  determinant  of  context  coding  variance.  it 
may  be  easier  for  an  observer  to  encode  patterns  from  the  larger, 
four -frequency,  pattern  set  of  experiment  3a  than  from  the 
smaller,  two-frequency,  set  of  experiment  1.  It  is  also  evident 
from  Table  III  that  the  values  of  parameter  c 3  were  approxi mately 
equal  in  experiments  1,  2a,  and  3a. 

Finally,  it  is  clear  from  Table  ill  that  the  value  of  C3 
varied  widely  across  the  three  experiments.  we  used  the 
expression  g/gl^  as  an  estimate  of  the  effect  of  the  envelope 
correlation  on  the  trace  variance,  and  ignored  the  influence  of 
other  variables  such  as  tone  duration  and  number  of  tones.  The 
large  variation  in  the  obtained  values  of  C3  indicates  that  the 
c3trg/g]2  expression  provides  an  inaccurate  estimate  of  the 
effect  of  envelope  variability. 


In  order  to  Improve  our  estimate  of  this  variance  component, 
we  examined  the  magnitude  of  the  correlation  between  the  temporal 
envelopes  of  sequences  in  our  experimental  conditions,  using 
computations  made  on  data  files  of  the  actual  stimulus  sequences. 
This  correlation  was  equal  to  unity  in  the  correlated  conditions 
of  all  experiments.  In  the  synchronized  condition  of  experiment 
3a,  the  envelope  correlation  was  essentially  constant  at  a  value 
of  0.7,  for  all  values  of  (rg.  in  the  uncorrelated  conditions  of 
experiments  la  and  2a,  the  envelope  correlation  was  initially 
equal  to  unity  (at  g  =  0  )  and  then  dropped  rapidly  to  a  value 
of  0.1  at  a  gap  variability  of  20  ms.  In  the  uncorrelated 
condition  of  experiment  3a,  the  presence  of  additional 
variability  in  the  tone  durations  caused  the  initial 
correlation  (at  [r  g  =  0)  to  be  approximately  0.16.  The 
correlation  then  fell  to  less  than  0.1  at  a  gap  variability  of 
5  ms .  We  attempted  to  fit  equation  3a  to  the  data  of 
experiments  1,  2a,  and  3a,  using  an  approximation  to  the  envelope 
correlation  function  in  place  of  {  ^g/gl2).  This  procedure 
resulted  in  relatively  poor  fits  to  the  data  because  of  the 
flatness  of  the  function  relating  envelope  correlation  to  gap 
var lability. 


3.6  DISCUSSION 


The  experimental  results  provided  qualitative  support  for 
the  main  assumptions  of  the  discrimination  model.  in  particular, 
the  results  are  consistent  with  the  idea  that,  trace  mode 
processing  is  dominant  in  the  correlated  conditions.  Performance 
was  Independent  of  gap  variability  in  the  correlated  condition, 
but  decreased  rapidly  with  gap  variability  in  the  uncorrelated 
condition  (Figures  2,  3,  and  7).  It  follows  from  an  assumption 
of  trace  mode  processing  that  performance  should  decrease  with 
Increases  in  the  time  Interval  that  the  trace  must  be  held  in 
storage,  such  as  the  time  between  the  onset  of  the  first  sequence 
and  the  onset  of  the  second  sequence.  That  prediction  was 
confirmed  in  experiment  3b  (figure  8)  which  showed  that 
performance  in  the  correlated  condition  decreased  as  a  function 


of  the  Inter -sequence- Interval .  Performance  In  the  uncorrelated 
condition  dropped  very  little  as  the  inter-sequence-interval 
increased.  The  prediction  was  also  confirmed  In  experiment  2b  by 
the  drop  In  correlated  condition  performance  when  the  average 
duration  of  the  gap  interval  was  Increased.  Increasing  the  gap 
duration  from  50  ms  to  200  m3  caused  an  increase  in  the  required 
storage  period  of  1650  ms  (11  x  150)  and  resulted  in  a  drop  in 
performance  of  about  1  d'  unit.  Performance  in  the  uncorrelated 
condition  Increased  during  this  manipulation. 

Performance  was  consistently  lower  in  the  uncorrelated 
conditions,  and  decreased  with  the  addition  of  temporal 
variability  to  the  sequence  envelopes,  as  shown  in  figures  2,  3, 

7  and  8.  We  attribute  these  decreases  to  increased  noise  in  the 

trace  process  produced  by  the  presence  of  variability  in  the 

sequence  traces  in  the  uncorrelated  condition.  As  the  magnitude 

of  the  sequence  variability  increases  in  the  uncorrelated  J 

condition,  the  variance  of  the  trace  process  increases  rapidly, 

and  eventually  context  mode  processing  dominates.  At  that  point, 

task  variables  which  affect  the  context  variance,  such  as  the 

number  of  possible  different  patterns,  influence  performance. 

Performance  in  both  the  correlated  and  uncorrelated 
conditions  of  experiment  2c  decreased  with  Increases  In  the 
number  of  tonal  elements  as  shown  in  figure  5.  The  largest  drop 
was  observed  in  the  correlated  condition.  The  small  drop  in 
performance  in  the  uncorrelated  condition  may  not  be  surprising 
since  one  would  expect  that  context  coding  variance  would 
increase  relatively  slowly,  perhaps  as  a  logarithmic  function  of 
the  number  of  tonal  patterns.  Part  of  the  drop  in  performance  for 
the  correlated  condition  is  attributable  to  the  increase  in  the 
total  storage  period  that  is  produced  by  increasing  n  (adding  one 
tone  Increases  the  total  storage  time  by  80.5  ms,  the  duration  of 
one  tone  plus  one  gap).  The  very  large  performance  decrease  in 
the  16  tone  case  may  be  due  to  other  factors  such  as  an  upper 
limit  on  the  length  or  duration  of  the  trace. 

An  interesting  result  concerns  performance  in  the 
synchronized  condition  of  experiment  3.  Performance  in  this 
condition  resembles  that  of  the  correlated  condition  in  several 
respects:  It  does  not  decrease  with  increasing  temporal 
variability  (figure  7)  and  it  decreases  with  increases  in  the 
magnitude  of  the  Inter -sequence- interval  (figure  8).  The  former 
result  is  consistent  with  our  calculation  of  a  constant 
correlation  of  approximate ly  0.7  between  the  temporal  envelopes 
of  the  synchronized  sequences  of  experiment  3.  Lower  performance 
in  the  synchronized  condition  than  in  the  correlated  condition  is 
consistent,  with  the  Idea  that  there  is  additional  trace  variance 
due  to  the  temporal  variability  of  the  sequences. 

4.  SEQUENCE  COMPARISON  ALGORITHMS 

The  effects  of  gap  variance  and  correlation  described  in  the 
preceding  sections  of  thl3  report  are  entirely  consistent  with 
the  extension  of  the  two-process  model.  However,  we  were 
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Interested  In  pursuing  several  additional  questions  about  the 
processing  mechanisms  involved  In  performance  of  the  sequence 
comparison  task.  The  first  question  concerned  the  specific 
mechanism  underlying  the  comparison  operation.  As  a  preliminary 
approach  to  this  problem  we  evaluated  some  specific  sequence 
comparison  algorithms  in  the  context  of  this  experimental  task. 

An  important  aspect  of  this  approach  is  that  it  Is  not  necessary 
to  make  assumptions  about  observer  processing  strategy  or  memory 
mode  for  particular  conditions;  a  comparison  algorithm  13 
postulated  and  then  applied  without  modification  to  all  the 
experimental  conditions.  If  successful,  the  postulated 
comparator  would  exhibit  the  observed  sensitivity  to  the 
temporal  coherence  of  the  stimulus  sequences,  as  well  as  to  other 
aspects  of  the  sequence  context. 

Consider  the  general  problem  of  sequence  comparison:  A 
sequence,  a,  is  a  series  of  elements,  aj,  a2,  .  ..,  an  where  the 

elements  aj  are  drawn  from  some  defined  alphabet.  One  wishes  to 
compute  a  measure  of  the  difference  or  similarity  between 
sequence  a  and  b.  Traditional  methods  for  comparing  sequences 
Include  Euclidean  distance  metrics,  the  Hamming  distance  measure, 
aid  others. 

These  traditional  measures  sum  the  results  of  all 
comparisons  between  element  pairs,  aj  and  bj  .  In  the  case  of 
tone  burst  sequences,  this  measure  would  be  a  function  of  the 
frequency  difference  between  the  tones.  For  the  Euclidean 
distance  measure  the  difference,  dab,  between  sequence  a  and  b 
is  given  by: 

n 

dab  -  l  (a.  b  .  )  )  1  /  2 

i-1 

For  the  Hamming  distance  measure, 

n 

^ab  =  d j  where  dj  0  If  nj-bj 

1  =  1  d  j  - 1  If  a^bi 

Both  these  measures  sum  tin  results  o£  comparisons  made  on 
corresponding  elements  in  each  sequence.  When  the  sequences  are 
composed  of  different  numbers  of  elements,  or  when  there  Is  no 
correspondence  between  the  elements  of  each  sequence,  more 
sophisticated  measures  are  needed.  The  requirement  for  comparing 
sequences  in  which  there  are  non  corresponding  elements  is  a 
problem  common  to  many  sequence  comparison  situations.  For 
example,  a  deleted  or  added  element  may  upset  the  normal 
correspondence  between  aj  and  bj  at  position  1  1 n  the  sequence. 
Distortions  in  the  time  periods  between  the  sequence  elements  or 
in  the  duration  of  the  elements  also  may  occur.  These  are  the 
sort  of  sequence  transformations  to  be  export uu  in  natural 
stimuli  such  as  speech.  We  would  expect  that  the  human  pattern 
processing  mechanism  would  be  able  to  handle  these  types  of 
distortions  and  trans format lon3  without  undue  difficulty.  Many 
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applications  o£  sequence  comparison  algorithms  (including  speech) 
are  described  in  a  book  edited  by  Sankoff  and  Kruskal  (1583). 

4.1  Weighted  Average  Algorithm 

An  example  of  a  more  complex  distance  measure  is  one  derived 
from  the  weighted  average  of  comparisons  made  between  all  pairs 
of  elements,  a^,bj,  in  each  sequence.  Element  pairs  that  are 
offset  by  a  large  difference  in  their  respective  position  receive 
a  smaller  weight  in  the  comparison  measure.  The  weighted  average 
measure  which  we  employed  is  based  on  one  used  by  Bradley  and 
Bradley  (1983)  In  a  study  of  sequence  comparison  techniques 
applied  to  the  study  of  bird  songs.  Bradley  and  Bradley  compared 
two  different  types  of  sequence  comparison  algorithms:  a  weighted 
average  algorithm  and  an  optimal  matching  algorithm.  We  used 
modified  versions  of  those  algorithms  as  candidate  comparator 
mechanisms  for  our  tonal  sequence  task  with  human  subjects. 

The  weighted  average  algorithm  employed  by  Bradley  and 
Bradley  uses  the  average  of  all  Inter-element  distances  for  all 
pairs,  i,l,  of  elements  from  the  two  sequences.  Measures  on  the 
distance  between  each  pair  of  elements  are  weighted  by  the 
function,  Wij,  which  is  a  linear  function  of  the  distance  between 
positions  i  and  j.  Their  algorithm  is  defined  as  follows: 
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The  weighted  average  algorithm  we  employed  was  the  same  as 
the  Bradley  and  Bradley  procedure  except  that  we  allowed  the 
computation  of  negative  distances  between  sequences,  and  we 
employed  a  modified  weighting  function.  It  the  span  between 
element  position  1  and  j  exceeded  a  constant  (about  20'*,  of  the 
sequence  duration),  the  weight  was  set  equal  to  zero.  If  the 
span  was  less  than  or  equal  to  that  constant,  a  unit  weight  was 
applied.  The  distance  between  elements,  Djj,  was  defined  as  a 
function  of  the  tone  frequency.  The  lowest  tone  frequency  was 
assigned  the  value  1,  the  next  2,  etc.  Dh  was  then  defined  as 
two  times  the  absolute  difference  between  those  values.  If 
either,  but  not  both,  of  the  elements  wa3  a  silent  period,  D^-j 
was  set  equal  to  unity. 

4.2  Optimal  Matching  Algorithm 

The  second  algorithm  employed  by  Bradley  and  Bradley  was  an 
optimal  matching  procedure  based  on  the  concept  of  the 
Levenshteln  distance  metric  (Kruskal,  1983).  These  metrics  allow 
comparisons  of  sequences  which  have  suffered  insertion/deletion 
or  time  compression/expansion  transformations.  These  metrics 
evaluate  the  summed  total  cost  of  the  basic  operations  needed  to 
transform  one  sequence  into  another.  For  example,  the 
substitution  of  very  dissimilar  tones  would  add  more  cost  than 
would  the  substitution  of  similar  tones.  The  minimum  possible 
cost  for  the  entire  set  of  operations  then  defines  the  distance 
between  the  sequences.  Following  Bradley  and  Bradley  (19B3),  the 
algorithm  for  the  specific  optimal  matching  metric  we  used  is 
recursively  defined  as  follows: 
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and  m,n  are  the  lengths  of  sequence  a  and  b. 

A  constant  value  of  1.9  tot  used  for  the  Insert ion/deletion 
cost  of  a  silent  element,  and  a  constant  value  of  3.6  was  used 
for  the  cost  of  inserting  or  deleting  any  tone  element.  The  cost 
of  substituting  an  element  of  tone  frequency  i  for  an  element  of 
tone  frequency  j  was  set  equal  to  2.3  times  the  absolute 
difference  between  the  tone  numbers  (lowest  frequency  equals  tone 
number  1,  next  frequency  equals  2,  etc.,  as  described  for  the 
weighted  average  algorithm).  (A  Pascal  routine  is  available  from 
the  author  which  implements  the  optimal  matching  procedure  using 
a  dynamic-programming  technique,  rather  than  explicit  recursion.) 

4.3  Analysis  of  Algorithm  and  Human  Performance 

Each  algorithm  was  tested  on  transformed  versions  of  the 
stimulus  sequences  used  in  the  human  experiments.  The  computer 
representation  of  each  stimulus  sequence  wa3  sliced  into  a 
sequence  of  segments  10  ms  in  duration.  These  segments  then 
became  the  elements  of  the  sequences  subjected  to  the  comparison 
algorithms.  Both  of  the  algorithms  incorporate  parameters  which 
will  affect  the  resulting  discrimination  performance.  No  attempt 
was  made  to  evaluate  a  large  volume  of  the  parameter  performance 
space.  Values  of  the  parameters  were  tried  until  a  reasonable 
range  of  performance  was  obtained  for  the  stimulus  sequences 
employed  in  the  experiment.  The  goal  was  to  determine  whether 
some  of  the  general  features  of  the  observed  human  behavior  would 
be  matched  by  the  algorithm,  rather  than  to  compute  a  fit  of  the 
algorithm  to  the  human  data. 

The  performance  of  the  two  algorithms  tested  Is  shown  as  the 
solid  and  dashed  lines  in  Figure  9;  the  plotted  points  were 
obtained  from  human  subjects  performing  a  rating  task  version  of 
the  conditions  described  in  section  1.1.  The  standard  deviation 
of  the  algorithm’s  responses  was  different  on  same  and  different 
trials,  preventing  use  of  a  direct  d’  measure.  The  performance 
of  the  algorithms  was  evaluated  by  determining  the  area  under  the 
Receiver  Operating  Characteristic  (ROC)  curves  (based  on  the 
algorithm's  trial  by  trial  responses)  and  then  converting  to 
equivalent  yes/no  d'  values.  The  performance  of  both  algorithms 
was  perfect  in  the  correlated  condition,  and  is  not  shown.  In 
the  uncorrelated  condition,  the  performance  of  both  algorithms 
decreased  as  a  function  of  gap  variability. 

Figure  10  shows  representative  ROC  curves  (plotted  on  normal 
probability  axes)  obtained  for  both  algorithms  and  for  two  of  the 
subjects.  The  slope  of  the  normal  probability  ROC  functions 
depended  somewhat  on  the  algorithm  employed;  slopes  generally 
were  less  than  1.0  for  the  weighted  average  algorithm  and 
exceeded  1.0  for  the  warp  algorithm.  Slopes  for  the  human  ROC 
curves  were  consistently  near  1.0. 
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Figure  9.  Average  performance  a»  a  function  of  the  gap 
standard  deviation  and  condition  (C,S,U)  for  the 
human  subjects  (plotted  symbols)  and  for  the  two 
comparison  algorithms  (solid  and  dashed  lines) . 
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Figure  |C.  Receiver  Operating  Characteristic  (ROC) 

curves  for  human  observers  RAT  and  AMC  (plotted 
points)  and  best  linear  fits  to  the  algorithms 
(solid  and  dashed  lines). 
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The  correlation  between  human  and  algorithm  performance  was 
computed  on  the  trial  by  trial  responses  of  the  human  subjects 
(ratings)  and  the  algorithms  (computed  responses).  Because  the 
magnitude  of  such  correlations  depends  on  the  level  of 
performance,  the  most  revealing  correlations  are  on  the 
conditions  having  the  poorest  performance.  At  the  uncorrelated 
30  ms  or  40  ms  conditions,  the  correlations  obtained  between 
individual  subjects  and  either  of  the  algorithms  was  generally 
less  than  0.4.  Neither  algorithm  correlated  consistently  higher 
than  the  other.  Correlations  between  the  algorithms  or  between 
pairs  of  subjects  also  were  generally  less  than  0.4.  Apparently, 
the  subjects  and  the  algorithms  (for  the  particular  parameter 
values  chosen)  order  the  similarity  of  sequence  pairs  in 
Idiosyncratic  ways. 

Both  of  the  algorithms  tested  produced  performance  which  was 
similar  to  the  profile  of  human  performance  over  the  different 
experimental  conditions  studied.  However,  note  that  these 
comparison  algorithms  are  not  models  of  human  tone  pattern 
discrimination.  To  construct  such  a  model,  we  would  need  to 
postulate  Internal  noise  in  order  to  obtain  less  than  perfect 
model  performance  in  the  correlated  condition.  A  reasonable 
assumption  would  be  that  this  internal  noise  would  take  the  form 
of  uncorrelated  temporal  jitter  added  to  the  tone  onsets.  To 
estimate  the  magnitude  of  this  internal  jitter,  we  could  observe 
the  performance  of  the  algorithm  in  low  levels  of  uncorrelated 
jitter.  When  the  performance  of  the  algorithm  in  the 
uncorrelated  jitter  matched  that  of  the  human  in  the  correlated 
jitter,  an  estimate  of  the  Internal  jitter  could  be  obtained. 

This  application  of  sequence  comparison  procedures  was  not 
intended  to  serve  as  a  test  of  whether  these  specific  algorithms 
will  be  good  models  of  tonal  sequence  discrimination.  Rather, 
our  analysis  demonstrates  that  such  algorithms  may  be  a  useful 
starting  point  in  modelling  how  humans  process  serially  defined 
stimuli.  A  number  of  other  computat i ona 1  procedures  and 
variations  of  these  algorithms  could  have  been  tested.  For 
example,  optimal  matching  procedures  can  incorporate  the  costs  of 
changes  in  the  durations  of  the  tones  or  in  the  gaps  between  the 
tones.  Algorithms  of  this  type  have  been  developed  for  the 
machine  processing  of  speech.  Such  algorithms  could  be  evaluated 
in  tone  pattern  discrimination  or  in  other  tasks  Involving 
stimulus  sequences.  Other  investigators  have  reported  the 
effects  of  specific  types  of  sequence  context  manipulations  on 
the  salience  and  dlscr lmlnabl 1 i ty  of  subsequences  (see  Leek, 
1987).  It  would  be  Interesting  to  see  whether  the  performance  of 
specific  comparison  algorithms  would  resemble  the  human 
observer's  sensitivity  (or  lack  of  sensitivity)  to  such 
contextual  manipulations  and  to  the  kinds  of  transformations 
discbssed  in  the  hext  section. 

ouir  current  research  plan  is  to  evaluate  different  sequence 
comparison  algorithms  directly,  by  use  of  a  rhythm  discrimination 
paradigm.  Thl3  procedure  requires  the  observer  to  report  whether 
or  not  two  sequences  of  tones  have  the  same  pattern  of  durations 
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and  inter-tone  gaps,  e.g.  the  same  rhythm  of  tones.  On 
"different"  trials,  the  second  sequence  on  each  trial  has  a 
temporal  Jitter  Introduced  into  its  inter-tone  gaps.  Performance 
will  be  evaluated  as  a  function  of  the  magnitude  of  this  added 
inter-tone  gap  jitter;  we  expect  that  performance  will  increase 
as  a  function  of  this  parameter.  We  will  also  modify  the  second 
sequence  of  each  trial  by  uniformly  compressing  or  expanding  all 
sequence  gaps  and  durations.  This  manipulation  maintains  the 
rhythm  pattern  but  disturbs  the  temporal  correlation  between  the 
envelopes  of  the  two  sequences.  Our  interest  is  in  defining  how 
performance,  at  a  given  jitter  magnitude,  depends  on  the  amount 
of  compression  or  expansion.  Thus,  we  hope  to  use  this 
experimental  paradigm  to  discriminate  between  sequence  comparison 
mechanisms  which  require  correlation-like  processes  and  those, 
such  as  the  optimal  matching  algorithm,  which  do  not. 

5.  LINEAR  TRANSFORMATION  OF  SEQUENCE  PARAMETERS:  PROPERTIES  OF 
THE  SENSORY  TRACE 

An  Important  question  in  sequence  comparison  is  the 
sensitivity  of  the  trace  and  context  mechanisms  to  particular 
types  of  transformations  in  the  sequences  to  be  compared.  The 
context  mechanism  is  assumed  to  be  insensitive  to  the  length  of 
the  I  SI  and  to  the  effects  of  certain  events  occurring  during  the 
ISI  period.  The  trace  mechanism,  on  the  other  hand,  is  assumed 
to  be  highly  sensitive  to  the  duration  of  the  ISI  and  to  certain 
other  potentially  interfering  events,  such  as  the  occurrence  of 
similar  stimuli  during  the  ISI  period.  The  specific  effects  of 
other  types  of  transformations  of  the  sequences  are  of 
considerable  theoretical  Interest.  Linearly  scaling  the  duration 
of  the  initial  or  final  sequence,  for  example,  should  have  a 
different  effect  on  performance  while  in  the  two  modes.  Such  a 
scaling  may  or  may  not  Impose  additional  processing  demands  on 
either  mode. 

For  example,  one  interesting  question  concerns  whether 
duration  compression  or  expansion  will  produce  decrements  in  the 
trace  node  which  are  a  function  of  the  ISI.  That  Is,  will 
duration  scaling  Interact  with  the  increase  in  trace  variance 
over  time?  The  answer  depends  on  one's  concept  of  the  trace 
process.  If  the  trace  is  a  true  raw  representa ion  of  the  Input, 
then  compressing  or  expanding  the  sequence  should  degrade 
performance  In  a  fashion  similar  to  the  addition  of  temporal 
jitter  to  the  sequence  Intervals,  and  an  Interaction  will  result. 
If,  however,  the  trace  is  (effectively)  only  a  partially  encoded 
representation  of  the  tone  bursts  and  the  temporal  (rhythmic, 
etc.)  relationships  among  the  tone  bursts,  then  such 
transformations  may  have  a  quite  different  effect.  In  that  case, 
the  effect  of  such  a  transformation  would  not  produce 
interactions  with  ISI,  but  rather  would  resemble  the  effects  of 
addlhg  temporal  jitter  in  the  context  mode:  e.g.  produce  a  fixed 
decrement,  depending  on  the  magnitude  of  the  transformation. 

These  predictions  were  subject  to  experimental  test. 
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5.1  Duration  scaling  Experiment 

in  experiment  4,  we  modified  the  manner  In  which  the 
durations  of  the  tones  and  gaps  of  the  second  sequence  In  a  trial 
were  generated.  In  the  'normal'  duration  transformation 
condition,  the  second  sequence  in  a  trial  was  generated  without 
modification.  Trials  in  this  condition  were  of  the  same  type  as 
those  shown  earlier.  In  the  'expanded'  duration  transformation 
condition,  the  durations  of  all  tones  and  gaps  in  the  second 
sequence  of  the  trial  were  multiplied  by  a  factor  of  1.4, 
producing  a  sequence  40%  longer  than  It  would  have  been  without 
the  transformation. 

Duration  transformation  was  manipulated  factor lally  with 
temporal  correlation,  producing  the  'correlated-expanded' 
condition,  in  which  the  two  sequences  in  a  trial  had  the  same 
temporal  pattern  prior  to  expansion  of  the  second  sequence,  and 
the  ' uncorrelated-expanded '  condition,  in  which  the  temporal 
patterns  of  the  two  sequences  In  a  trial  were  different  prior  to 
expansion  of  the  second  sequence.  Typical  trials  from  each  of 
these  conditions  are  shown  in  figure  11. 


Figure  11.  Examples  of  pairs  of  tonal  sequences  in  the  expanded 
conditions  of  experiment  4. 
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We  were  interested  in  whether  operation  In  the  trace 
mode  requires  that  the  two  sequences  have  correlated  temporal 
envelopes.  Another  way  to  put  the  question  Is  to  ask  whether 
subjects  would  treat  expansion  o £  the  second  sequence  of  a  trial 
as  a  difference  in  the  sequence  temporal  patterns-  similar  to  that 
found  in  the  uncorrelated  condition-  or  if,  instead,  they  would 
be  able  to  scale  the  duration  of  the  sensory-trace  of  one 
sequence  to  match  that  of  the  other.  If  such  a  scaling  process 
were  possible,  then  one  would  expect  to  see  the  same  general 
pattern  of  results  in  the  correlated-expanded  condition  as  in  the 
correlated-normal  condition.  If  such  a  scaling  process  were  not 
possible,  then  the  pattern  of  performance  in  the  correlated- 
expanded  condition  should  closely  resemble  that  found  in  the 
uncorrelated-normal  condition. 

The  results  can  be  seen  in  figure  12  and  13.  Figure  12 
shows  performance  averaged  across  subjects  for  the  correlated 
conditions.  Performance  in  both  the  normal  and  expanded  cases 
was  independent  of  gap  standard  deviation,  but  decreased  with 
increasing  ISI.  Figure  13  shows  performance  in  the  uncorrelated 
conditions.  In  this  condition,  performance  was  Independent  of 
ISI  in  the  conditions  involving  large  differences  in  sequence 
temporal  patterns  within  a  trial  (i.e.  those  in  which  the  gap 
standard  deviation  was  40),  but  decreased  with  ISI  in  those 
conditions  which  did  not  involve  large  temporal  pattern 
differnces  (i.e.  those  in  which  the  gap  standard  deviation  was 
10).  Note  that  performance  in  both  the  uncorrelated-normal  and 
uncor related-expanded  conditions  decreased  with  increasing  gap 
standard  deviation.  The  results  from  the  'normal'  duration 
transformation  conditions  replicate  the  results  of  experiments  1 
through  3,  while  the  results  from  the  'expanded'  conditions 
support  the  duration-scaling  hypothesis  previously  proposed. 

While  performance  In  the  expanded  conditions  Is  poorer  than  that 
In  the  normal  conditions,  perhaps  due  to  trace -decay  during 
duration  scaling,  the  general  pattern  of  results  is  the  3ame, 
leading  one  to  believe  that  S's  were  able  to  compensate  for 
sequence  expansion  Ln  these  conditions,  perhaps  by  "scaling"  the 
sensory  traces. 

Another  Interesting  transformation  to  be  evaluated  is  that 
of  frequency.  One  particular  question  relates  to  frequency 
transformations  of  an  octave  or  less  (either  up  or  down). 

Because  of  the  octave  generalization  phenomenon,  we  would  expect 
that  the  trace  (or  "encoded"  trace,  if  our  duration 
transformation  results  are  confirmed)  would  be  insensitive  to 
octave  frequency  transformations,  but  possibly  not  to  other 
transformations-  Depending  on  the  outcome  of  these  experiments, 
we  will  modify  the  concept  of  the  trace  mechanism  in  the  two - 
process  dlscr imihat ion  model  to  incorporate  the  observed  results. 
Preliminary  data  Erom  a  frequency  transformation  experiment 
Indicated  that  observers  probably  cannot  scale  frequency  In  t lie 
idDM  Way  as  duration;  performance  as  a  function  of  the 
intersequence  interval  flattens  out  as  the  magnitude  of  the 
frequency  transformation  Is  Increased. 
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12.  Average  performance  as  a  function  of  1  rit  er  sequenc  < 
Interval,  gap  standard  deviation,  and  temporal 
trans-formation,  for  the  correlated  conditions  of 
experiment  4. 
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experiment  4. 


6.  PROCESSING  INFORMATION  FROM  TONAL  SEQUENCES  AND  VISUAL 

DISPLAYS 

Our  Interest  In  the  perception  of  auditory  tonal  sequences 
and  In  signal  detection  models  of  human  detection  and 
discrimination  led  us  to  study  some  problems  concerning  the 
display  of  information  via  the  auditory  or  visual  modalities. 

This  section  discusses  a  signal  detection  theory  based-method  for 
evaluating  visual  and  auditory  displays.  The  method  is  used  to 
study  to  how  a  subject  acquires  information  from  a  multiple 
element  display.  The  method  allows  assessment  of  the  processing 
characteristics  associated  with  individual  elements  of  the 
display.  Using  the  method,  it  is  possible  to  compare  the 
effectiveness  of  different  display  codes  and  of  different 
arrangements  of  display  elements.  This  summary  concentrates  on 
the  mechanics  of  the  method  and  summarizes  dctta  from  some 
experiments  with  visual  and  auditory  displays.  This  work  was 
performed  in  collaboration  with  Donald  E.  Robinson  and  Bruce  G. 
Berg  of  Indiana  University. 

The  basic  experimental  paradigm  is  a  signal  detection  task 
in  which  the  subject  must  decide  whether  event  A  or  event  B  led 
to  the  displayed  data.  Either  event  A  and  B  occurs  randomly  on  a 
trial.  The  display  consists  of  n  display  elements;  each  element 
conveys  information  about  the  trial  event  which  led  to  the 
displayed  data.  The  displayed  elements  are  generated  by  a 
statistical  process  whose  parameters  depend  on  which  event  (A  or 
B)  was  selected  on  that  trial.  For  example,  consider  a  display 
consisting  of  the  four,  two-digit,  numerical  elements: 

I  1.3  3.1  0.2  2.4  I 

These  two-digit  elements  are  generated  from  one  of  two 
normal  distributions:  distribution  "A",  having  a  mean  of  1.5  arid 
a  standard  deviation  of  1.2,  and  distribution  "B",  having  a  mean 
of  2.5  and  a  standard  deviation  of  1.2.  On  an  "A"  trial,  each  of 
the  four  display  elements  is  independently  generated  from  the  A 
distribution.  On  a  "B"  trial,  each  is  drawn  from  the  B 
distribution.  Given  the  subject's  hit  and  false  alarm  rates,  we 
can  then  compute  a  measure  of  his/her  performance  with  the  signal 
detection  measure,  d'.  For  a  display  containing  only  a  single 
element,  d'  for  an  optimal,  noiseless,  observer  would  be  equal  to 
the  difference  between  the  means  of  the  A  and  B  distributions, 
divided  by  the  common  standard  deviation,  that  i3,  d'=  0.83. 

It  should  be  evident  that  the  displayed  information  (the 
normal  deviates)  could  be  presented  to  the  subject  in  different 
ways;  in  analog  form  as  in  a  straight  line  or  square  array  of 
line  gauges,  or  in  an  auditory  display  in  which  the  information 
is  coded  by  the  frequency  of  tones  in  a  sequence  of  tones.  How 
would  an  Ideal  observer  perform  with  a  display  composed  of  n 
independent  display  elements?  It  can  be  shown  the  performance  of 
an  ideal  observer  Increases  with  the  square  root  of  the  number  of 
elements  In  the  display.  Hence,  in  the  four  element  case 
described  above,  ideal  performance  is  equal  to  a  d'  of  0.83  x  2 


I 


T" 


or  1.67. 

Naturally,  for  certain  display  formats  and  display 
durations,  we  would  expect  that  processing  limitations  would 
prevent  the  human  subject's  d'  from  increasing  at  the  same  rate 
as  the  ideal  observer.  And,  in  some  cases,  human  performance  may 
level  off  at  some  asymptotic  level;  passed  some  point,  increasing 
the  number  of  elements  would  not  result  in  increased  performance. 

0 

}  Figure  14  shows  data  obtained  in  auditory  experiments 

using  sequentially  presented  tones.  It  is  clear  that  the  subject 
fails  to  accumulate  information  at  the  Jn  rate.  The  auditory  data 
was  well  fit  by  a  model  which  incorporated  two  sources  of 
k  internal  noise:  one  associated  with  the  observation  of  each 

r  element  and  one  with  the  formation  of  the  decision  statistic. 
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Figure  14.  Average  performance  <d')  of  four  observers  to  a 

frequency-coded  sequential  auditory  stimulus  as  a 
function  of  the  number  of  tones  in  the  sequence. 
Solid  line:  square-root-of -n  prediction.  Dashed 
line:  prediction  of  a  partitioned  variance  model. 
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Figure  15.  Average  performance  (d1)  of  three  observers  to  a 
visual  display  as  a  function  of  the  number  of. 
elements.  Solid  line:  square:-  root-of-n  prediction. 


Figure  15  3hows  performance  .averaged  across  three 
subjects  In  a  visual  display  experiment  using  digital  and  analog 
(line  gauge)  display  elements  arranged  in  both  straight  line  and 
square  arrangements.  The  plotted  data  were  obtained  at  a  display 
duration  of  255  msec;  the  solid  line  is  ideal  (jn)  behavior.  At 
low  values  of  n,  the  human  data  are  very  close  to  the  ideal.  At 
higher  values  of  n,  performance  in  the  square  array  analog  and 
numerical  conditions  reaches  asymptotic  levels.  Data  obtained 
with  the  same  displays  at  a  duration  of  1000  msec  matched  the 
Ideal  functions  very  closely  at  all  values  of  n. 

The  asymptotic  nature  of  the  visual  display  performance,  at 
high  values  of  n,  13  consistent  with  the  idea  that  the  subject 
cannot  obtain  full  Information  from  all  the  display  positions,  at 
least  In  a  brief  stimulus  exposure.  An  important  question,  then. 
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Is  whether  the  subject  obtains  full  Information  from  a  small 
subset  of  display  elements  or  partial  information  from  every 
display  element.  If  the  subject  uses  partial  Information  from 
all  the  elements  to  make  a  decision,  one  might  ask  whether 
information  from  some  display  elements  weighs  more  heavily  than 
from  other  elements  In  determining  the  decision.  An  analytical 
procedure,  developed  by  D.  E.  Robinson  and  Bruce  Berg  of  Indiana 
University,  allows  some  resolution  of  these  important  questions 
about  subject  performance. 

The  amount  of  information  that  the  subject:  obtains  from 
different  elements  of  a  display  will  he  reflected  in  the 
variability  of  the  subject's  response  that  may  be  attributed  to 
each  display  element.  Recall  that  on  each  trial,  the  subject 

responds  "signal  A"  or  "signal  B " .  Thus,  one  can  compute  the  \ 

probability  that  the  subject  responds  "  as  a  function  of  the 

particular  values  taken  by  the  element  at  position  i  in  the 

display  array.  Such  a  plot  should  generate  the  cumulative 

probability  distribution  for  the  "g>"  response  given  the  value  of 

element  i,  as  shown  in  figure  16.  Suppose  that  this  function 

were  flat-  one  would  conclude  that  the  subject's  response  was 

Independent  of  the  value  of  display  element  i;  similarly,  if  the 

function  resembled  a  step  function,  we  could  perfectly  predict 

the  response  by  knowing  the  element  value.  The  point  Is  that, 

under  certain  assumptions,  the  variance  of  this  distribution, 

e.g.  the  slope  of  this  cumulative  probability  function,  provides 

an  estimate  of  the  contribution  of  element  i  to  the  subject's 

decision.  The  lower  the  variability,  the  higher  the  contribution 

of  that  element  to  the  subject's  decision. 

The  next  several  figures  illustrate  the  results  of  an 
analysis  of  this  sort  applied  to  the  visual  and  auditory  displays 
described  earlier.  Figure  17,  from  the  experiment  with  a 
sequentual  auditory  display,  shows  the  slope  of  the  element 
probability  distribution  as  a  function  of  temporal  position.  The 
parameter  on  the  figure  is  the  total  sequence  length,  n.  If  each 
element  n  the  sequence  contributed  equally  to  the  final 
decision,  the  lines  in  Figure  17  would  be  horizontal.  It  is 
clear  that  the  last  tone  in  a  sequence  contributes  more  to  the 
final  decision  than  do  tones  in  the  middle,  which  contribute  less 
than  those  near  the  beginning  of  the  sequence.  The  result  is 
similar  to  that  observed  in  experiments  on  the  recall  of  lists  of 
verbal  materials. 
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Figure  16.  Probability  of.  observer  ' s  "yes"  response  given  the 
particular  magnitudes  (over  trials)  of  the  display 
element  in  a  visual  detection  task  (two  different 
elements ) . 


Figure  17. 


Flope  of  the  Probability  vs.  Element  Magnitude 
curves  as  a  fun*  t Ion  of  clement  sequential  position 
(auditory  dlsplev  experiment).  Curves  are  given  for 
' equences  rompo:  of  6  tones  and  10  tones. 


Figures  16,  19,  20,  and  21  are  similar  plots  of  the  slopes 
of  the  element  probability  functions  for  nine  element  visual 
displays.  Figure  18  is  average  data  from  four  subjects  in  a 
visual  display  experiment  using  nine  digital  elements  arranged  in 
a  straight  line.  The  major  contribution  to  the  subjects' 
response  is  from  the  three  elements  in  the  center  of  the  display. 
Figure  19  shows  the  subjects'  performance  with  a  9  element  analog 
display  arranged  in  a  line.  Now,  it  appears  that  information 
from  many  elements  has  an  influence  on  the  response.  Figures  20 
and  21  show  similar  plots  of  data  for  9  element  digital  and 
analog  displays  arranged  in  a  square  spatial  pattern.  A  similar 
picture  emerges:  Information  is  accrued  from  most  of  the  elements 
in  the  analog  display,  but  from  only  a  few,  centrally  located  and 
spatially  adjacent  elements  in  a  digital  display.  (The  data  from 
individual  subjects  resembled  the  averaged  data  very  closely.) 

This  analytical  technique  may  enable  useful  comparisons  to 
be  made  between  different  types  and  arrangements  of  visual  and 
auditory  displays  and  may  provide  important  information  about  how 
human  observers  process  information  from  stimulus  displays. 
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